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Abstract — In this work, we consider tlie minimum distance 
properties and convergence tliresliolds of 3-dimensionaI turbo 
codes (3D-TCs), recently introduced by Berrou et al.. Here, we 
consider binary 3D-TCs wliile tlie original work of Berrou et al. 
considered double-binary codes. In the first part of the paper, 
the minimum distance properties are analyzed from an ensemble 
perspective, both in the finite-length regime and in the asymp- 
totic case of large block lengths. In particular, we analyze the 
asymptotic weight distribution of 3D-TCs and show numerically 
that their typical minimum distance rfmin may, depending on 
the specific parameters, asymptotically grow linearly with the 
block length, i.e., the 3D-TC ensemble is asymptotically good for 
some parameters. In the second part of the paper, we derive 
some useful upper bounds on the dmin when using quadratic 
permutation polynomial (QPP) interleavers with a quadratic 
inverse. Furthermore, we give examples of interleaver lengths 
where an upper bound appears to be tight. The best codes (in 
terms of estimated dmin) obtained by randomly searching for 
good pairs of QPPs for use in the 3D-TC are compared to 
a probabilistic lower bound on the dmin when selecting codes 
from the 3D-TC ensemble uniformly at random. This comparison 
shows that the use of designed QPP interleavers can improve the 
dmin significantly. For instance, we have found a (6144, 2040) 
3D-TC with an estimated dmin of 147, while the probabilistic 
lower bound is 69. Higher rates are obtained by puncturing 
nonsystematic bits, and optimized periodic puncturing patterns 
for rates 1/2, 2/3, and 4/5 are found by computer search. 
Finally, we give iterative decoding thresholds, computed from 
an extrinsic information transfer chart analysis, and present 
simulation results on the additive white Gaussian noise channel 
to compare the error rate performance to that of conventional 
turbo codes. 

Index Terms — Asymptotic minimum distance analysis, distance 
bounds, EXIT charts, hybrid concatenated codes, QPP inter- 
leavers, spectral shape function, turbo codes, uniform interleaver. 



I. Introduction 

Turbo codes have gained considerable attention since their 
introduction by Berrou et al. in 1993 [1] due to their near- 
capacity performance and low decoding complexity. The con- 
ventional turbo code is a parallel concatenation of two identi- 
cal recursive systematic convolutional encoders separated by a 
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pseudo-random interleaver Most conventional turbo codes us- 
ing 8-state constituent encoders suffer from a flattening around 
a frame error rate (FER) of 10^'^ due to a poor minimum 
distance dmin- To improve the performance in the error floor 
region, one could either design a better interleaver, use more 
powerful constituent encoders, or increase the dimension, i.e., 
the number of constituent encoders. The latter alternative was 
recently pursued in [2,3], where a powerful coding scheme 
nicknamed 3-dimensional turbo code (3D-TC) and inspired 
by the proposals in [4,5], was introduced. The coding scheme 
consists of a conventional turbo encoder and a patch. In more 
detail, a fraction A of the parity bits from the turbo encoder 
are post-encoded by a third rate-1 encoder The value of A 
can be used to trade-off performance in the waterfall region 
with performance in the error floor region. As shown in [2, 3], 
this coding scheme is able to provide very low error rates for 
a wide range of block lengths and code rates at the expense 
of a small increase in decoding complexity with respect to 
conventional turbo codes. 

It is known that conventional turbo codes and single serially 
concatenated convolutional codes are asymptotically bad, in 
the sense that their typical dmin asymptotically does not 
grow linearly with the block length [6,7]. As an alternative, 
multiple serially concatenated codes, such as repeat multiple- 
accumulate (RMA) codes, can be used, since they yield a 
better dmin- In [8], it was shown that there exists a sequence of 
RMA codes with minimum distance converging in the limit of 
infinitely many accumulators to the Gilbert- Varshamov bound 
(GVB). The stronger result that the typical (i,„in converges 
to the GVB was recently proved in [9]. Also, in [10], it 
was conjectured by Pfister that the d,„iii of RMA codes 
asymptotically grows linearly with the block length, and that 
the growth rate is given by the threshold where the asymptotic 
spectral shape function [11] becomes positive. Kliewer et al. 
and Ravazzi and Fagnani showed independently in [9, 12] that 
RMA code ensembles with two or more accumulators are 
indeed asymptotically good (their typical dmin asymptotically 
grows linearly with the block length). A formal proof was 
given in [9], and a method for the calculation of a lower bound 
on the asymptotic growth rate coefficient was given in [12]. 
The analysis was later extended in [13, 14] to low -rate hybrid 
concatenated codes, i.e., mixed parallel and serial structures 
combining the features of the two concatenations. 

In the first part of this paper, following these works, we ana- 
lyze the minimum distance properties of 3D-TCs. We perform 
an asymptotic dmin analysis of 3D-TCs by using a numerical 
procedure to estimate their asymptotic spectral shape function. 
The numerical procedure is based on the approach in [15] 
to compute asymptotic input-output weight distributions of 
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convolutional encoders. It is shown numerically that for certain 
parameters, the dmin of 3D-TCs asymptotically grows Hnearly 
with the block length. We also perform a finite-length analysis 
of the dmin of 3D-TCs using a probabilistic lower bound, 
outUned in [8], on the dmin- 

Interleavers for conventional turbo codes have been exten- 
sively investigated. The dithered relative prime (DRP) inter- 
leavers [16, 17] and the almost regular permutation (ARP) in- 
terleavers [18] are considered among the best ones, since they 
provide relatively high dmin- Recently, Sun and Takeshita [19] 
suggested the use of permutation polynomial (PP) based inter- 
leavers over integer rings. In particular, quadratic polynomials 
were emphasized. In contrast to DRP and ARP interleavers, PP 
interleavers are fully algebraic, allowing a theoretical analysis 
of their performance. In [20], the dmin of conventional binary 
turbo codes with quadratic permutation polynomial (QPP) 
interleavers was considered in detail, and large tables of 
optimum (in terms of dmin and its corresponding multiplicity) 
QPPs for conventional turbo codes with 8-state and 16-state 
constituent encoders were presented. In the most recent work 
[21], Takeshita considered the use of higher degree PPs with 
great success. 

A suitable property for designed interleavers is the 
contention- free property, i.e., to avoid memory contentions in 
parallelized decoding 1 18, 22, 23]. While ARP and some mod- 
ified DRP interleavers [22] are contention-free, they are not 
maximum contention-free, i.e., every factor of the interleaver 
length is not a possible degree of parallel processing of the 
decoder. On the other hand, in [24] it was shown that all PPs 
generate maximum contention-free interleavers. Thus, these 
interleavers are very interesting from an implementation point 
of view. Furthermore, QPP interleavers are almost as good as 
DRP interleavers for a large number of short-to-medium block 
lengths in terms of decoding convergence and performance in 
the error floor region [19-21,24]. 

In the second part of this work, we analyze minimum dis- 
tance properties of 3D-TCs with dedicated QPP interleavers. 
In particular, we present several upper bounds on the dmin of 
binary 3D-TCs when using QPP interleavers with a quadratic 
inverse that do not depend on the permutation and the encoder 
in the patch, as long as the encoder maps the all-zero sequence 
to the all-zero sequence. Furthermore, we present some results 
from a random search for good pairs of QPPs for use in the 
binary 3D-TC. It is shown that the use of designed QPPs 
yields a very high dmin, improving significantly compared to 
the probabilistic lower bound on the dmin- 

The remainder of this paper is organized as follows. The 
encoder structure and design guidelines for 3D-TCs are de- 
scribed in Section II. Section III describes a probabilistic lower 
bound on the dmin of a code ensemble and its apphcation to 
3D-TCs. The asymptotic spectral shape function is introduced 
in Section IV along with a numerical procedure to estimate it. 
Furthermore, we show numerically that the 3D-TC ensemble 
is asymptotically good for certain parameters. The minimum 
distance properties of 3D-TCs with dedicated QPP interleavers 
are addressed in Sections V and VI. In particular. Section V 
describes QPPs and some of their properties, and in Sec- 
tion VI, an upper bound on the dmin with QPP interleavers 
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Fig. 1 . 3D turbo encoder. A fraction A of the parity bits from both constituent 
encoders Ca and ( are grouped by a parallel/serial multiplexer, permuted 
by interleaver He, and encoded by the rate-1 post-encoder C^. 

with a quadratic inverse for a conventional turbo code is 
presented, along with upper bounds on the dmin of 3D-TCs 
with QPP interleavers with a quadratic inverse and with any 
patch. Convergence properties are studied in Section VII, 
where an extrinsic information transfer (EXIT) chart analysis 
is performed. In Section VIII, the results of a random search 
for good pairs of QPPs for use in the 3D-TC are presented 
and compared to the finite-length results from Section III. 
Also, optimized puncturing patterns for rates 1/2, 2/3, and 
4/5 are established by computer search. Finally, simulation 
results on the additive white Gaussian noise (AWGN) channel 
are presented to compare the error rate performance to that 
of conventional turbo codes. Conclusions and a discussion of 
future work are given in Section IX. 

II. Coding Scheme 

A block diagram of the 3D-TC is depicted in Fig. 1. The 
information data sequence u of length K bits is encoded 
by a binary conventional turbo encoder. By a conventional 
turbo encoder we mean the parallel concatenation of two 
identical rate-1 recursive convolutional encoders, denoted by 
Ca and Cb, respectively. Here Ca and Cb are 8-state recursive 
convolutional encoders with generator polynomial g{D) = 
[l + D + D^)/{1 + -D^ + D^), i.e., the 8-state constituent 
encoder specified in the 3GPP/UMTS standard [25]. The 
code sequences of Ca and Cb are denoted by Xa and Xb, 
respectively. We also denote by x^^ the codeword obtained 
by alternating bits from Xa and Xb. A fraction A (0 < A < 1), 
called the permeability rate, of the parity bits from x'^*-' are 
permuted by interleaver He (of length = 2\K), and 
encoded by an encoder of unity rate Cc, called the patch 
or the post-encoder [3]. This can be properly represented 
by a puncturing pattem p applied to x"'"*-' (see Fig. 1) of 
period A^^p containing XNp ones (where a one means that 
the bit is not punctured). The fraction 1 — A of parity bits 
which are not encoded by Cc is sent directly to the channel. 
Equivalently, this can be represented by a puncturing pattern 
p, the complement of p. We denote by Xc the code sequence 
of Cc. Also, we denote by xf^ and the sub-codewords of 
Xa and Xb, respectively, sent directly to the channel, and by 
x'^'^ the codeword obtained by alternating bits from x^^ and 
x^. Likewise, we denote by xP and x^ the sub-codewords 
of Xa and Xb, respectively, encoded by Cc, and by xP the 
codeword obtained by alternating bits from x^ and x^. Finally, 
the information sequence and the code sequences x'^'^ and Xc 



3 



are multiplexed to form the code sequence x, of length N bits, 
transmitted to the channel. Note that the overall nonoinal code 
rate of the 3D-TC is R = K/N = 1/3, the same as for the 
conventional turbo code without the patch. Higher code rates 
can be obtained either by puncturing x'^'^ or by puncturing 
the output of the patch, Xc. In this paper, we consider the 
following puncturing strategy. First, puncture x'^'^. Then, if 
further puncturing is required, puncture Xc. 

In [3], regular puncturing pattems of period 2/ A were 
considered for p. For instance, if A = 1/4, every fourth 
bit from each of the encoders of the outer turbo code are 
encoded by encoder Cc. The remaining bits are sent directly 
to the channel, and it follows that p = [11000000] and 
p = [00111111]. In this paper, we consider both regular and 
random pattems for p. 

The 3D-TC can be decoded using the turbo principle. The 
decoder consists of three soft-input soft-output decoders C~^, 
C^^, and corresponding to the three constituent encoders 
Ca, Cb, and Cc, respectively. A decoding iteration consists 
of a single activation of C~^, C~^, and C^^, in this order. 
This process continues iteratively until the maximum number 
of iterations is reached or an early stopping rule criterion is 
fulfilled. 

A. Design Guidelines 

In [3], some guidelines for choosing the permeability rate 
A were given. In general, choosing a large value for A will 
increase the minimum distance. However, the performance in 
the waterfall region will degrade with increasing values of A. 
Thus, there is trade-off between performance in the waterfall 
and error floor regions. 

According to [3], the choice of the post-encoder is crucial 
for the code performance. In general, the post-decoder must 
be simple and be able to handle soft-input and produce soft- 
output information. Furthermore, the post-decoder must not 
exhibit too much error amplification (see [3] for details), since 
this will result in a high loss in convergence. In this paper, 
we consider the encoder with generator polynomial g{D) = 
l/{l + D^)forCc [3]. 

III. Weight Enumerators and Finite-Length 
Minimum Distance Analysis 

In this section, we analyze the minimum distance properties 
of 3D-TCs. In particular, we consider the ensemble of codes 
in the form of Fig. 1 obtained by considering all possible 
permutations for 11 and He through the uniform interleaver 
approach [26]. 

Let A'^ 1^ denote the input-output weight enumerator 
(lOWE) of a code C, i.e., the number of code sequences 
of weight h corresponding to input sequences of weight w. 
Also, let A^ = ^1^, Af^, 1^ be the weight enumerator (WE) of 
the code C, i.e., the number of code sequences of weight h. 
Denote by 5a and the weights of code sequences Xa and Xb, 
respectively, and by q its sum, q = qa + qh (i-C-, the weight 
of x^*-^). We also denote by n^, rib, and n the weights of 
code sequences xP, x^, and xP, respectively (n = ria + rib). 
Likewise, we denote by ma, mb, and m the weights of code 



sequences x!;", x^", and x™, respectively (m = ma + mb). 
Note that q = n + m. 

A. lOWE of 3D-TCs With Random Puncturing Pattern p 

We assume a random puncturing pattern for p. The punc- 
turing pattems are sampled uniformly at random among all 
those with [dN\ ones, where 6 is the fraction of bits that 
survive after puncturing. The average lOWE of the ensemble 
of punctured codes Cpunct, with input weight w and output 
weight h' is given by [12] 

N (h\/ N-h \ 

jCpunct _ Sr^ aC \h'}\lSN}-h'J 
^w,h' — Z^^rvM / N \ ^'^) 

h=h' \ysN\) 

where h is the output weight before puncturing. 

Using the concept of uniform interleaver [26] and (1) the 
ensemble-average lOWE of the 3D-TC ensemble, denoted by 
C, can be computed as 

AC.^ aCi. (q\(2K-q\ aCc 

tC _ \ ^ ^m,ija ttiig-ga \n/ \2\K-n! n,h-w-q+n 

q,q^,n \w) \1\K) \ n J 

(2) 

B. lOWE of 3D-TCs With Regular Puncturing Pattern p 

Here, we assume the use of regular (i.e., nonrandom) 
puncturing pattems for p. In this case, the ensemble-average 
lOWE of the 3D-TC ensemble can be written as 

TC _ «',(ma,na) «),(m— rna,n— ria) n,h—w—m 

^w,h — 2-^ 7k\ ■ 72\K\ 

m,m^,n,ni^ \w/ \ n / 

(3) 

where A'^'', x = a, b, is the number of codewords 

of constituent encoder Cx with input weight w, and output 
weights TOx and rix corresponding to the sub-codewords sent 
directly to the channel and to encoder Cc, respectively. We 
remark that the two enumerators in (2) and (3) are not the 
same, since the first is averaged over puncturing pattems, while 
the second is for a given puncturing pattern. Thus, there is a 
sUght abuse of notation. 

C. Finite-Length Minimum Distance Analysis 

The ensemble-average WE can be used to bound the 
minimum distance rfmin of the code ensemble C in the 
finite-length regime. In particular, the probability that a code 
randomly chosen from the ensemble has minimum distance 
c^min < d is uppcr-boundcd by [8] 

d-l 

Pr(d^i„<d)<^A^. (4) 

h=l 

The upper bound in (4) can be used to obtain a probabilistic 
lower bound on the minimum distance of a code ensemble. For 
a fixed value of e, where e is any positive value between 
and 1, we define the probabilistic lower bound with probability 
e, denoted by (imin,LB,e> to be the largest integer d such that 
the right-hand side of (4) is at most e. This guarantees that 
Pr(rfmin >d)>l-e. 
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Fig. 2. Probabilistic lower bound on the minimum distance of 3D-TC 
ensembles, R = 1/3, for several values of A using regular and random 
puncturing patterns, when e = 0.5. 
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Fig. 3. Probabilistic lower bound on the minimum distance of 3D-TC 
ensembles, R = 1/2, for several values of A using regular puncturing patterns, 
when e = 0.1 and 0.5. 



In Fig. 2, we plot the probabilistic lower bound from (4), 
<^min,LB,o.5> for 3D-TC enscmblcs, R = 1/3, with regular 
puncturing patterns p and for several values of A as a function 
of the block length N . We assumed c = 0.5, which implies 
that Pr((imin > d) > 0.5, i.e., we expect that at least half 
of the codes in C have a dmin at least equal to the value 
predicted by the curves. For comparison purposes, the GVB 
is also displayed in the figure. For values of A down to 1/4, 
the growth rate of the dmin appears to be linear with the 
block length, while for smaller values of A little growth is 
observed. The achievable minimum distances are quite high, 
especially for high values of A. For instance, the bound for 
the 3D-TC ensemble with A = 1 is daVm ~ 215 for a 
block length of = 1950 bits. Reducing A, significantly 
reduces the growth rate. For instance, for A = 1/2, the 
probabilistic lower bound gives dmin ~ 92 for the same block 
length. For higher rates, the curves get closer to the GVB. In 
Fig. 2, we also plot the probabilistic lower bound from (4) 
for i? = 1/3 3D-TC ensembles with A = 1/2 and 1/4 using 
random puncturing patterns p. Compared to regular puncturing 
patterns, the growth rate of the dmin is slightly smaller. Thus, 
regular puncturing patterns seem to be a good choice for p. 

In Fig. 3, we plot the probabilistic lower bound from (4) for 
i? = 1/2 3D-TC ensembles using regular puncturing patterns 
p for several values of A and the code block lengths used for 



the QPP interleavers in Section VIII. In the figure, we plot the 
bound for e = 0.5 and 0.1, i.e., 50% and 90% of the codes 
have minimum distance at least equal to the value predicted 
by the curves, respectively. In all cases, a linear-like growth 
rate is observed. 



IV. Asymptotic Minimum Distance Analysis 

In this section, we analyze the asymptotic behavior of the 
WE of 3D-TC ensembles to show that their typical rfmin grows 
linearly with the block length for some parameters. To this 
end, we consider the behavior of the asymptotic spectral shape 
function of the code ensemble, defined as [11] 



1 — c 
r(p) =limsup — In^i^j 

N — >-oo J-^ 



(5) 



where sup( ) denotes the supremum of its argument, p = ^ 
is the normalized output weight, and TV is the block length. 

From (5), we can write A^^ ~ Q^rip) ^jjgjj — y 
Therefore, if there exists some abscissa > such that 
supp<p. r{p) < Vp* < po, and r{p) > for some p > po, 
then it can be shown, with high probability, that the dmin 
of most codes in the ensemble grows Unearly with the block 
length N, with growth rate coefficient of at least po [9, 12]. On 
the other hand, if r(p) is strictly zero in the range (0, po), then 
we cannot conclude directly whether d^i^ grows linearly with 
block length or not. In [9], it was shown that the asymptotic 
spectral shape function of RMA codes exhibits this behavior, 
i.e., it is zero in the range (0, po) and positive for some p > po. 
However, by combining the asymptotic spectral shapes with 
the use of bounding techniques, Ravazzi and Fagnani were 
able to prove in [9, Theorem 6] that the minimum distance of 
RMA codes indeed grows linearly with the block length with 
growth rate coefficient of at least po. 

We remark that in the rest of the paper, with a slight 
abuse of language, we sometimes refer to po as the exact 
value of the asymptotic growth rate coefficient. However, we 
emphasize that, strictly speaking, po is only a lower bound on 
the asymptotic growth rate coefficient. 



A. Asymptotic Spectral Shape Function of 3D-TCs 

For analysis purposes, we assume a random puncturing 
pattern p. Let C be an {N, K) code. We define the asymptotic 
behavior of the lOWE for C as the function 



'(a,/3) =limsup — lnAL„xj,L;8ivj (6) 

N — yoo 



where a = w/K and /3 = /i/A^ are the normalized input 
weight and the normalized output weight, respectively. Using 
(2), (6), and Stirling's approximation for binomial coefficients 
(fc) ~ e""'*^/") for n cx) and k/n constant, where H(-) 
is the binary entropy function with natural logarithms, the 
asymptotic spectral shape function of the (unpunctured) 3D- 
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Fig. 4. Asymptotic spectral sliape function for 3D-TCs, A = 1, and several 
code rates. 



TC ensemble C can be written as 



(p) = \ sup ia'^'^i'^, ta) + a^'-iuj, 2i - t^) 



(7) 



-H(a;) -2H(A) -2AH(/x)} 



where oj = w/K, ta = Qa/K, i = q/^K, and /x = n/Nc = 
n/2XK. For higher rates, i.e., when puncturing is appHed to 
x'^^ or to Xc, a similar expression is obtained with some extra 
terms. Note that in (7) we did not include the constraints on the 
relationship of the variables involved in the optimization. The 
constraints on the involved variables can be derived by looking 
at the arguments of the involved binary entropy functions in 
(7). In general, the argument of the binary entropy function 
should be between and 1. 

The asymptotic lOWE of the iimer code Cc is equal to the 
asymptotic lOWE of an accumulate code [15] and can be given 
in closed form as 



a^-{a,l3) = (1-/3)H 



2(1-^) 



On the other hand, a closed-form expression for the asymptotic 
lOWE of the 8-state convolutional encoder is not known. 
However, in [15], Sason et al. provided a numerical method for 
computing the asymptotic lOWE of convolutional encoders. 
Here, we use the numerical procedure proposed in [15] to 
compute the asymptotic lOWEs a'-^^(w, ta) and a"-^i'(w, 2t — tg,) 
in (7) to numerically compute the asymptotic spectral shape 
function. 

The numerical evaluation of the asymptotic spectral shape 
function is displayed in Fig. 4 for 3D-TCs, A = 1, and several 
code rates, and in Fig. 5 for i? = 1/3 3D-TCs and several 
values of A. The behavior of the asymptotic spectral shape 
function is similar to the one of RMA codes [9], i.e., it is zero 
in the range (0, po) and strictly positive for some p > pq. 
Note that since closed-form expressions for 0*^^(0;, ia) and 
a^^{L0,2L — La) are not known, we cannot provide a formal 
proof that 3D-TCs are asymptotically good. However, the 
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Fig. 5. Asymptotic spectral shape function for R = 1/3 3D-TCs and several 
values of \. 

TABLE I 

Estimated growth rate coefficient po of 3D-TCs. 







R = 1/3 


R = 1/2 


R = 2/3 


R = 4/5 


A = 


1 


0.102 


0.077 


0.052 


0.030 


A = 


1/2 


0.029 


0.031 


0.023 


0.015 


A = 


1/3 


0.008 


0.010 


0.008 


0.006 


A = 


1/4 


0.001 


0.002 


0.001 


0.001 



numerical evaluation of the asymptotic spectral shape function, 
together with extensive numerical experiments that show that 
Pr((i,njn < [po-^'^J) — > as N gets large, suggest that 3D- 
TCs for the parameters in Figs. 4 and 5 (for A down to 1 /4) 
are indeed asymptotically good. Furthermore, the results are in 
agreement with the finite-length analysis. On the other hand, 
for A = 1/8, the asymptotic spectral shape function is strictly 
positive, meaning that the dmin does not grow linearly with 
the block length. 

The estimated growth rate coefficient po of 3D-TCs is 
reported in Table I for several code rates and values of the 
parameter A. As expected, for a fixed code rate, the growth 
rate coefficient increases as A increases. 

Notice that the growth rate coefficient for R = 1 /2 is higher 
than for i? = 1/3 for all values of A, except for A = 1. An 
heuristic explanation is that for A = 1 /2 and lower, to achieve 
R = 1/2, only parity bits sent directly to the channel are 
punctured. The minimum weight of the parity bits sent directly 
to channel will not grow linearly with the block length, since 
this would imply that the minimum distance of a conventional 
turbo code would grow linearly with the block length, which is 
not the case [6]. Thus, we would expect that there is not much 
difference in the growth rate between i? = 1/3 and R — 1/2 
for A < 1/2, and the values for J? = 1/2 can even be higher, 
since the rate is higher. 

B. Threshold Under Maximum- Likelihood (ML) Decoding 

The asymptotic spectral shape function of a code ensemble 
can also be used to derive a threshold under ML decoding. 
An upper bound on the ML decoding threshold of a code 
ensemble, due to Divsalar [27], is given by 



1 



< -- • max 

R o<p<i 



(l_e-2'-(p))(i_p) 



2p 



(8) 



. -^0 / ML.threshold 

where R is the code rate, r{p) is the asymptotic spectral 
shape function, Eh/No denotes the signal-to-noise ratio, and 
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TABLE II 

Upper bounds on the ML decoding threshold of 3D-TCs based 

ON DiVSALAR'S bound IN [27]. 





R = 1/3 


R = 1/2 


R = 2/3 


R = 4/5 


Capacity 


-0.495 dB 


0.187 dB 


1.059 dB 


2.040 dB 


A = 1 


-0.440 dB 


0.319 dB 


1.325 dB 


2.475 dB 


A = 1/2 


-0.352 dB 


0.400 dB 


1.358 dB 


2.485 dB 


A = 1/3 


-0.272 dB 


0.514 dB 


1.452 dB 


2.509 dB 


A = 1/4 


-0.211 dB 


0.605 dB 


1.668 dB 


2.553 dB 



(i?6/A''o)ML, threshold IS the ML decoding threshold. We com- 
puted the upper bound on the ML decoding threshold in (8) 
numerically for 3D-TC ensembles for several values of A and 
code rates. The results are given in Table II. For comparison 
purposes, we also report in the table the binary-input AWGN 
Shannon limit. For A = 1 and R = 1/3, the ML decoding 
threshold is very close to the capacity limit, while the gap 
to capacity generally increases with the code rate. Also, note 
that similarly to the growth rate coefficient, the upper bounds 
on the ML decoding threshold decrease as the value of A 
increases. In Section VII, we will compare these upper bounds 
on the ML decoding threshold to iterative decoding thresholds 
computed from an EXIT chart analysis. 

V. QPPs OVER Integer Rings 

In the previous sections we analyzed the minimum distance 
properties of 3D-TC ensembles (generated by varying 11 and 
He over all possible permutations) and showed numerically 
that their dmin grows linearly with block length for certain 
values of A. In the following, we consider the minimum 
distance properties of 3D-TCs with designed interleavers. In 
particular, we consider QPP interleavers. In this section, we 
establish notation and restate the criterion for existence of 
QPPs over integer rings. The interested reader is referred 
to [19,28] for further details. 

Definition 1: Given an integer M > 2, a polynomial 
f{x) = fix + f2x'^ (mod M), where fi and /2 are non- 
negative integers, is said to be a QPP over the ring of integers 
Zm when f{x) permutes {0, 1, 2, . . . , M — 1}. 

In this paper, let the set of primes he V = {2, 3, 5, 7, . . .}. 
Then an integer M can be factored as M = Ylp^-pP^"'", 
where nM,p > 1 for a finite number of p's and riM.p = 
otherwise. For example, if M = 3888 = 2** x 3'' we have 
^3888,2 = 4 and n3888,3 = 5. For a quadratic polynomial 
f{x) = fix + f^x"^ (mod M), we will abuse the previous 
notation by writing /2 = HpepP"*^''' i-^-' exponents of 
the prime factors of /2 will be written as nF,p instead of 
the more cumbersome n/2,p because we are interested in the 
factorization of /2. 

Let us denote a divides h by a\b and hy a \ b otherwise. 
The greatest common divisor of a and h is denoted by 
gcd(a, h) and the least common multiple of a and h is denoted 
by lcm(a, 6). The necessary and sufficient condition for a 
quadratic polynomial f{x) to be a PP is given below. 

Proposition 1 ([19,28]): Let M = Upev P""''" ■ ^he nec- 
essary and sufficient condition for a quadratic polynomial 
f{x) = fix + f2x'^ (mod M) to be a PP can be divided 
into two cases. 



1) Either 2 f M or 4|M (i.e., nM,2 7^ 1) 
gcd(/i,M) = 1 and /2 = ripepf'"^"''^^.P ^ ^' 
such that nM,p > 1. 

2) 2|M and 4 t M (i.e., nM.2 = 1) 

/i + /2 is odd, gcd(/i,f) = 1, and /2 = 
npe7'P"^")^f,p ^ 1' '^P such that p 2 and 
nM,p > 1. 

For example, if M = 256, then we determine from case 1) 
of Proposition 1 that /i € {1,3,5,..., 255} (set of numbers 
relatively prime to M) and /2 € {2, 4, 6, . . . , 254} (set of 
numbers that contain 2 as a factor). This gives us 128 x 127 = 
16256 possible pairs of coefficients /i and /2 that make f{x) 
a PP. 

Finally, we remark that some QPPs have a quadratic inverse, 
i.e., the inverse permutation can also be generated by a QPP. 
We will not state the exact conditions here, but refer the 
interested reader to [28] for further details. 



A. Quasi-Cyclic Property of 3D-TCs Using QPPs 

Assume tailbiting termination of the upper and lower con- 
stituent encoders and of the encoder in the patch of the 3D-TC. 
Furthermore, 1 /A is assumed to be a divisor of K, the length 
of the interleaver 11, and the puncturing pattem p is assumed 
to be regular, i.e., p = [1100 • • • 00]. 

Lemma 1: The 3D-TC is quasi-cyclic with period p, where 

p = l- \cm{K/ gcd(2/2, K), 1/A, K/ gcd(2/2, N,)), 

f{x) = fix + /2x2 (mod if) and /(.x) = fix + f2x'^ 
(mod N(^) generate the turbo code interleaver and the per- 
mutation in the patch, respectively, and Nc is the input length 
to the patch, as defined in Section II. Furthermore, / is the 
smallest positive integer solution to the quadratic congruence 

2Ap((/i - 1)Z + /2p/2) = (modiVc) (9) 

where p = p/l. 

Proof: Let u denote an input sequence to a 3D-TC, let i, 
<i < K, denote an arbitrary position in u, and let ~^^ denote 
a quasi-cyclic shift of period p of u. Now, the position i+p 
(mod K) in 1^ is interleaved to f{i+p) = f{i) + f{p)+2f2ip 
(mod K). Furthermore, to make the difference between the 
interleaved positions f{i + p) and f{i) independent of i, or 
equivalently, XbCit) a quasi-cyclic shift of Xb(u), we require 
that 2f2P = (mod K), i.e., p = I - K/ gcd{2f2, K), for 
some positive integer I. Also, to make xP( i?) a quasi-cyclic 
shift of xP(u), both 2A(/(p) - p) = (mod N^) (which 
gives (9)) and {1/X)\p must hold. Finally, to make Xc(lt) 
a quasi-cyclic shift of Xc(u), 2Xp needs to be a multiple of 
Nc/ gcd(2/2, Nc), or equivalently, p needs to be a multiple of 
K/ gcd(2/2, Nc), since xP(li') is a quasi-cyclic shift of xP(u) 
of period 2Xp, and the result follows. ■ 
Without the patch, the conventional binary turbo code is 
quasi-cyclic with period if/ gcd(2/2, if ). We remark that a 
similar result will hold for 3D double-binary turbo codes [3] 
with symbol interleaving based on QPPs. 
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VI. Upper bounds on dmhi with QPPs with a 

QUADRATIC INVERSE 

In this section, we present upper bounds on dmin with QPPs 
with a quadratic inverse. 

First, we state a general result from [29] on the minimum 
distance of a conventional binary turbo code with QPP in- 
terleavers. We assume tailbiting termination of the upper and 
lower constituent encoders. 

Theorem 1 ([29]): The minimum distance of a conven- 
tional binary turbo code (of nominal rate 1/3) using primitive 
feedback and monic feedforward polynomials of degree 
and QPPs with a quadratic inverse, is upper-bounded by 
2(2"^+! + 9). 

We remark that Theorem 1 applies for all interleaver lengths 
K and is achievable for a range of i^-values [29]. The bound 
of Theorem 1 is due to an input-weight 6 codeword containing 
3 input-weight 2 fundamental paths, or error events, in both the 
upper and lower constituent codewords. Also, the upper bound 
of Theorem 1 can be shown to hold with dual termination 
[30] as well, i.e., the upper and lower constituent encoders are 
forced to begin and end in the zero state, when K > 2''+'^ — 7 
[29]. 

A. Upper Bounds on dmin With QPPs With a Quadratic 
Inverse for the 3D-TC With Any Patch 

In this subsection, we consider the binary 3D-TC of Fig. 1 
with any given patch and such that 4| (1/A) and with regular 
puncturing pattern p = [1100 • 00]. Note that it may be 
possible to derive bounds for other values of A using a similar 
procedure as the one outlined below. Also, we remark that the 
bounds here can be generalized to 3D-TCs with lower and 
upper constituent encoders other than the 8-state constituent 
encoder with feedforward polynomial 1+D+D^ and feedback 
polynomial 1 -|- + . However, in this work, we will 
constrain the analysis to this 8-state encoder and to the case 
when 4|(1/A)|i^. On the other hand, we do not consider a 
specific encoder for C^. In fact, in the analysis below, the 
only condition we require is that the encoder of the patch 
maps the all-zero sequence to the all-zero sequence. Also, 
we assume tailbiting termination of the upper and lower 
constituent encoders. Note that the termination method of the 
encoder in the patch is not an issue here. 

The upper bounds in this section are based on certain critical 
codewords that always occur for some specific lengths for any 
QPP interleaver (with a quadratic inverse) for the conventional 
binary turbo code. To find these critical codewords, we have 
used the following strategy. 

• First select a particular interleaver length K and perform 
a random search for good pairs of QPPs using the triple 
impulse method [17] to estimate the (imin of the 3D-TC. 

• Low-weight codewords identified by the triple impulse 
method are added to a list of codewords. Within this list 
of codewords, there are often codewords that give an all- 
zero input sequence into the patch, and it is sometimes 
possible to identify among these codewords certain types 
of codewords that occur repeatedly. 



This is how we have found the critical codewords depicted in 
Figs. 10, 11, and 12 in Appendices A and B. These critical 
codewords give the following two theorems, where f{x) = 
/ix + /2X^ (mod K) is the QPP that generates the interleaver 
of the outer turbo code and g{x) = gix + g2X^ (mod K) is 
its inverse. 

Theorem 2: The minimum distance of a binary 3D-TC with 
feedforward polynomial 1 + D + and feedback polynomial 
1 + D"^ + for the upper and lower constituent encoders 
and with QPP interleavers with a quadratic inverse, is upper- 
bounded by 67 when 1) the interleaver length K satisfies the 
conditions ^ 

7, ifp = 2 
1 . otherwise, 



nK,p < 



(10) 



2) the encoder in the patch maps the all-zero sequence to the 
all-zero sequence, and 3) 4|(1/A)|_ft'. 

Proof: See Appendix A. ■ 

For example, K = 1504 and 640 satisfy the inequality in 
(10), since 1504 = 2^ x 47 and 640 = 2^ x 5. 

Theorem 3: The minimum distance of a binary 3D-TC with 
feedforward polynomial 1 + D + D^ and feedback polynomial 
1 + D'^ + for the upper and lower constituent encoders 
and with QPP interleavers with a quadratic inverse, is upper- 
bounded by 27 (resp. 54) when 1) 2g2 = (mod K) (resp. 
4^2 = (mod K)), 2) the encoder in the patch maps the 
all-zero sequence to the all-zero sequence, and 3) 4|(1/A)|ii'. 
Proof: See Appendix B. ■ 

We remark that Theorem 3 can be formulated with the 
conditions 2/2 = (mod K) and 4/2 = (mod K) instead 
of the conditions 2g2 = (mod K) and 4£(2 = (mod K) 
due to symmetry. Note that the condition 2^2 = (mod K) 
is equivalent to the fact that the QPP g{x) = gix + g2X^ 
(mod K) is indeed a linear PP, i.e., the same permutation is 
generated by a linear PP. 

Lemma 2: If K > 32, then the upper bound on the rfmin of 
67 in Theorem 2 holds with dual termination as well. 

Proof: See Appendix C. ■ 

We remark that a lower bound on K can be derived, in 
a similar fashion, to make the upper bounds on the dmin in 
Theorem 3 hold with dual termination as well. We omit the 
details for brevity. 

Finally, note that in principle it is possible to derive bounds 
for other values of A and when puncturing is applied using the 
procedure above, both for binary and double-binary codes. 

VII. EXIT CHART ANALYSIS 

In this section, we estimate the convergence thresholds of 
3D-TCS through an EXIT chart analysis [31] on the AWGN 
channel using regular puncturing patterns p. Also, higher code 
rates are obtained by randomly puncturing nonsystematic bits 
according to the puncturing strategy in Section 11. 

The EXIT charts of two 3D-TCs, with parameters A = 1/2, 
R = 1/3, and A = 1/4, i? = 2/3, at an Eb/No of 7 = 0.55 
dB and 7 = 1.72 dB, respectively, are depicted in Fig. 6. The 
sohd curves are the EXIT curves of the outer turbo code, while 
the dashed curves are the EXIT curves of the inner encoder. In 
both cases a turmel between the two EXIT curves is observed. 
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-O- X=M2. R=1/3 
-□— X=1/4. R=2/3 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Ia(Cc); Ie(TC) 

Fig. 6. EXIT charts of A = 1/2, R = 1/3, 3D-TC (7 = 0.55 dB) and of 
A = 1/4, R = 2/3, 3D-TC (7 = 1.72 dB). The dashed curves are the EXIT 
curves of the inner encoder, while the sohd curves are the EXIT cimies of 
the outer turbo code. 

TABLE m 

Convergence Thresholds of 3D-TCs. 







R = 1/3 


R = 1/2 


R = 2/3 


R = 4/5 


A = 


1 


1.26 dB 


2.00 dB 


2.99 dB 


4.14 dB 


A = 


1/2 


0.52 dB 


1.24 dB 


2.23 dB 


3.39 dB 


A = 


1/3 


0.30 dB 


1.00 dB 


1.88 dB 


3.00 dB 


A = 


1/4 


0.20 dB 


0.85 dB 


1.68 dB 


2.79 dB 


A = 





-0.05 dB 


0.62 dB 


1.53 dB 


2.46 dB 



meaning that convergence is possible at this Eh/No. Note that 
both EXIT functions of the inner code and the outer turbo 
code depend on the Eb/No. For each pair of curves, a vertical 
step between the lower curve (i.e., the EXIT curve of the turbo 
code) and the upper curve (i.e., the EXIT curve of the inner 
code) represents a single activation of decoder C~^, while a 
horizontal step between the upper curve and the lower curve 
represents an unspecified number of activations of decoders 
and Cy^^ until nothing more can be gained. For A = 1/2, 
R = 1/3, the tunnel between the two curves opens at 7 = 0.52 
dB, predicting a convergence threshold around this value. For 
A = 1/4, R = 2/3, the tunnel opens at 7 = 1.68 dB. The 
convergence thresholds of 3D-TCs are given in Table III for 
several code rates and values of A. The convergence thresholds 
were computed assuming a block length K = 10^ bits'. For a 
given code rate, the best convergence threshold is achieved by 
the fully parallel concatenated code (A = 0), while increasing 
A leads to poorer thresholds. On the other hand, in terms 
of rfmin, the behavior is the opposite, i.e., larger minimum 
distances are obtained by increasing A. This suggests that there 
is a trade-off between iterative convergence threshold and dmin 
growth rate. 

'We remark that the thresholds in Table III are slightly different from the 
thresholds in Table II in [32]. This is due to the fact that the thresholds in 
[32] were computed using a shorter block length. 



It is interesting to note that the iterative decoding thresholds 
get worse for increasing values of A, while the upper bounds 
on the ML decoding threshold (see Table II) improve. This 
behavior can be explained from the fact that the latter depends 
on the code while the former depends also on the decoding 
algorithm. The sub-optimality of the iterative decoding algo- 
rithm is expected to be higher when the contribution of the 
patch is higher (i.e., for increasing values of A), which explains 
the results in Table HI compared to the upper bounds on the 
ML decoding threshold in Table II. 

VIII. Numerical results 

In this section, we present some numerical results from a 
random search for pairs of QPPs with A = 1/4 and with 
regular puncturing pattern p = [11000000], which give good 
estimated minimum distance for the binary rate- 1/3 3D-TC. 
As shown above, A = 1/4 is a good compromise between 
minimum distance and convergence threshold. To estimate 
the minimum distance, we used the triple impulse method 
[17]. Also, to speed up the search, we limited the search 
space through the conditions 2^/2 ^ (mod K), I — 2 and 
4, whenever appropriate. The rationale behind the condition 
I6/2 ^ (mod can be found in the proof of Theorem 2 
in Appendix A. In the search, all three constituent encoders 
were forced to begin and end in the zero state. The results, 
for some specific lengths and rates (high rates are obtained 
by puncturing nonsystematic bits, as explained in Section II), 
are tabulated in Table IV. For each code rate, the minimum 
distances of the 3D-TCs, estimated using the triple impulse 
method, are denoted by rfj,^),, where R is the code rate. For 
comparison purposes, we have also tabulated the probabilistic 
lower bound on the dmin from (4) with e = 0.5, denoted by 
'^min LB' usiug random puncturing patterns. 

For rate 1/3, the results are reported in column 7 in 
Table IV. The /C- values in bold indicate lengths where the 
upper bound in Theorem 2 appears to be tight. In the table, 
we also report (in column 8) the probabilistic lower bound 
on the dmin from (4), and the optimum dmin (in column 15) 
for a conventional rate- 1/3 binary turbo code with a QPP 

TO fl/3) 

interleaver, denoted by d^^^ .As expected, the designed 
QPP interleavers improve the dmin significantly with respect to 
the probabilistic lower bound. Moreover, the 3D-TCs achieve 
much higher minimum distances than the best conventional 
turbo codes. Therefore, they are expected to exhibit much 
lower error floors. 

The results for rates 1/2, 2/3, and 4/5 are tabulated in 
columns 9, 11, and 13 in Table IV, respectively. Specific 
periodic puncturing patterns have been found by computer 
search, following the puncturing strategy in Section II. In more 
detail, for rates less than 2/3, only bits from x'^^ are punctured. 
In particular, for rate R = 2/3, all bits from x'^'^ are punctured. 
For rates larger than 2/3, all bits from x'^'^ and some bits from 
Xc are punctured. 

In the computer search, both error rate performance in the 
error floor region (i.e., the minimum distance) and the waterfall 
region have been considered. Performance in the waterfall 
region is particularly important for rate 4/5, since some 
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TABLE IV 

Results from a random search for pairs of QPPs with A = 1 /4, both with a quadratic inverse, in which the first QPP f{x) 

GENERATES THE TURBO CODE INTERLEAVER AND THE SECOND QPP f{x) GENERATES THE PERMUTATION IN THE PATCH. HIGHER RATES ARE OBTAINED 
BY PUNCTURING THE NONSYSTEMATIC BITS USING OPTIMIZED (BY COMPUTER SEARCH) PERIODIC PUNCTURING PATTERNS. 





r, 
J I 


.12 


\V 


,/ i 


J 2. 


mill 


mill. LB 


mill 


iiiiii.LB 


,/'-■■■■" 

liiili 


ill ill, LB 


ill in 


liiin.LB 


jTC\(l/3) 
mill 


512 


175 


192 


256 


15 


192 


67 


33 


38 


19 


17 


11 


15 


7 


38 [20] 


640 


631 


40 


320 


21 


180 


67 


36 


41 


21 


17 


12 


16 


8 


39 [29] 


768 


613 


24 


384 


73 


216 


79 


39 


42 


24 


22 


14 


17 


9 


39 [29] 


1024 


465 


224 


512 


157 


160 


93 


45 


54 


28 


23 


17 


22 


11 


45 [20] 


1504 


299 


IS8 


752 


147 


282 


07 


57 


41 


36 


17 


^2 


16 


15 


50 |29] 


2048 


673 


448 


1024 


71 


192 


147 


69 


110 


45 


64 


29 


40 


19 


50 [29] 



puncturing patterns show very poor error rate performance. In 
fact, some puncturing pattems do not give a distinguishable 
waterfall region at all. This is the case for 54 of the 70 
puncturing pattems of period 8 for Xc . In more detail, for rate 
1/2, we have looked at all 15 ((2)) periodic puncturing pat- 
terns of period 6 for x^'^ and Xjj'^, and all their combinations, 
i.e., we have looked at 15 x 15 = 225 periodic puncturing 
patterns of period 12 for x'^'\ For rate 2/3, according to 
the puncturing strategy above, there is only one puncturing 
pattern to check, and finally, for rate 4/5, we have looked 
at all 70 ((4)) periodic puncturing patterns of period 8 for 
Xc. In the computer search, for each candidate puncturing 
pattern, an upper bound on the dmin was computed using 
the triple impulse method, which generated an ordered list 
(the candidates were ordered according to the computed upper 
bound on the rfmin) of puncturing pattems. Also, to speed up 
the search, candidates where rejected if a codeword of weight 
less than some running d-^a-a value was found. Finally, an 
estimate of the dmin of the best candidates in the ordered 
list, in the order of the list, was computed using a stronger 
version of the triple impulse method. By a stronger version 
of the triple impulse method we mean an impulse method 
where the impulse ranges have been extended quite a bit. The 
actual fine-tuning of the impulse ranges has been done based 
on experimental results and running time considerations. For 
rate 4/5, an initial filtering of the candidate puncturing pattems 
based on convergence can also be done. 

For comparison purposes, we report, in columns 10, 12, and 
14 in Table IV, the probabilistic lower bound (with e = 0.5) on 
the dmin from (4) using random puncturing pattems for rates 
1/2, 2/3, and 4/5, respectively. Note that the designed QPP 
interleavers improve the dmin with respect to the probabilistic 
lower bound, even for high rates, except for K = 1504 and 
rate 2/3. However, K = 1504 is indicated in bold in Table IV, 
which means that the upper bound in Theorem 2 applies. Thus, 
this is not a good value for K with QPP interleavers, and 
should therefore be avoided. 

A. Simulation Results 

In Fig. 7, we give FER curves on the AWGN channel 
of a nominal rate- 1/3 binary 3D-TC with K = 1024 and 
2048. The turbo code interleavers were generated by the QPPs 
f{x) = 465x + 224x'^ (mod 1024) and f{x) = 673a;-|-448a;2 




10 I ' 1 ' ' ^ ' ' 

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 

Eb/NoindB 



Fig. 7. Comparison of FER performance of iterative decoding of conventional 
turbo codes and binary ZD-TCs. The nominal code rate is 1 /3. 



(mod 2048), and the permutations in the patch were gen- 
erated by the QPPs f{x) = 157a; + leOa;^ (mod 512) and 
f{x) = 71x + 192x2 (mod 1024), for K = 1024 and 
2048, respectively (see Table IV). In the simulation, we used 
A = 1/4 and a maximum of 16 iterations. The actual code 
rate is {K — 2u — i/)/3K due to trelUs termination, where 
is the constraint length of the upper and lower constituent 
encoders and i> is the constraint length of the third encoder in 
the patch. Furthermore, to simpUfy the decoders, the max-log 
approximation with scaling of the extrinsic values was used 
within the Bahl-Cocke-Jehnek-Raviv (BCJR) algorithm. For 
comparison purposes, the performance of conventional binary 
turbo codes with input length K = 1024 and 2048, and with 
rate {K—2i')/3K (due to dual termination), is also given in the 
figure. The turbo code interleaver for K = 2048 was generated 
by the QPP f{x) = 21.x + USx"^ (mod 2048), which gives 
the optimum (the upper bound in Theorem 1 is achieved) 
minimum distance of 50 [29], and the same maximum number 
of decoding iterations was used. For K = 1024, the turbo code 
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Fig. 8. Comparison ofFER performance of iterative decoding of conventional Fig. 9. Comparison ofFER performance of iterative decoding of conventional 
turbo codes and binary 3D-TCs. The nominal code rate is 1/2. turbo codes and binary 3D-TCs. The nominal code rate is 4/5. 



interleaver was generated by the QPP f{x) = 245a; + MSx^ 
(mod 1024), which is a dmin-optimum QPP interleaver giving 
a dmin of 45 [29]. As can be seen from Fig. 7, there is a 
performance loss of about 0.3 dB in the waterfall region for the 
3D-TC compared to the conventional turbo code. This loss in 
performance is consistent with the EXIT chart analysis of the 
previous section (see Table III). However, for very low error 
rates, the performance of the 3D-TC will clearly be superior 
to the performance of the conventional turbo code, due to a 
much larger dmin- 

In Figs. 8 and 9, we give the FER performance of the 
3D-TCs from Fig. 7 with different puncturing pattems. For 
comparison purposes, the performance of the conventional 
binary turbo codes from Fig. 7 with different puncturing 
patterns is also plotted. The nominal code rate of the codes 
in Fig. 8 is 1/2, while the nominal code rate of the codes in 
Fig. 9 is 4/5. In both cases, optimized puncturing pattems (the 
puncturing pattems used to produce the entries in Table IV) 
were used in the simulations of the 3D-TCs. Thus, the 3D- 
TCs in Fig. 8 have estimated minimum distances of 54 and 
110 for K = 1024 and 2048, respectively. In Fig. 9, the 
3D-TCs have estimated minimum distances of 22 and 40 for 
K = 1024 and 2048, respectively. Also, for a fair comparison, 
the puncturing pattems for the conventional binary turbo codes 
were optimized as well, using the same interleavers as for 
the rate- 1/3 mother codes, which were optimized through a 
QPP interleaver search, giving the minimum distance values 
in column 15 of Table IV. In fact, the conventional binary 
turbo codes in Fig. 8 have minimum distances of 24 and 
25 for K = 1024 and 2048, respectively. In Fig. 9, the 
conventional binary turbo codes have minimum distances of 8 
and 9 for ii" = 1024 and 2048, respectively. From the figures, 
we observe no error floor down to a FER of about 10~^ for 



the 3D-TCs. Also, as expected, there is a loss in performance 
in the waterfall region with respect to conventional binary 
turbo codes. However, for low error rates, the performance 
of the 3D-TCs will clearly be superior to the performance 
of the conventional binary turbo codes for both rates. The 
conventional turbo code shows a flattening between a FER 
of 10-* - 10-5 and 10-^ - 10-* for i? = 1/2 and 4/5, 
respectively, while significantly lower error floors are expected 
for the 3D-TC due to its better minimum distance. 

IX. Conclusion and Future Work 

In this work, we presented a finite-length and an asymptotic 
minimum distance analysis of 3D-TCs with binary constituent 
encoders. By using a numerical procedure, recently proposed 
by Sason et ai, to compute the asymptotic lOWE of con- 
volutional encoders, we numerically evaluated the asymptotic 
spectral shape function of 3D-TCs and showed (numerically) 
that for certain parameters, the 3D-TC ensemble is asymp- 
totically good, in the sense that its typical minimum distance 
asymptotically grows linearly with the block length. The re- 
sults were supported by the finite-length analysis. Higher rates 
obtained through random puncturing were also considered. 
In the second part of the paper, designed QPP interleavers 
were considered. In particular, we derived some useful upper 
bounds on the dmin when using this type of interleaver with 
a quadratic inverse. A random search for pairs of QPPs for 
use in the 3D-TC was performed, and the best codes (in 
terms of estimated minimum distance) were compared to a 
probabilistic lower bound on the dmin- Higher rates were 
obtained through specific optimized (by computer search) 
periodic puncturing pattems. This comparison showed that the 
use of designed QPP interleavers can improve the minimum 
distance significantly. Finally, an EXIT chart analysis was 
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Fig. 10. An input-weight 16 critical codeword with QPP interleavers with 
a quadratic inverse for a conventional binary turbo code without the patch. 



conducted, iterative decoding thresholds and upper bounds 
on the ML decoding threshold were computed, and some 
simulation results were presented to compare the error rate 
performance of the 3D-TC to that of the conventional turbo 
code. 

An interesting topic for future work is to derive upper 
bounds on the dmin for 3D-TCs with QPP interleavers with a 
quadratic inverse for other values of the interleaver length K, 
i.e., for interleaver lengths that do not satisfy the constraints 
of Theorem 2, for instance, when K is a power of two. Also, 
as shown by Theorem 1, there is an upper bound on the dmin 
which is independent of the interleaver length K when using 
QPP interleavers with a quadratic inverse for a conventional 
binary turbo code. Can a similar bound be found for 3D-TCs? 

Appendix A 
Proof of Theorem 2 

In Fig. 10, an input-weight 16 critical codeword is shown 
for the conventional binary turbo code. The upper and lower 
constituent codewords contain 5 fundamental paths each. The 
interleaving of the systematic 1-positions is indicated by 
arrows and the 16 first letters of the English alphabet. The 
fundamental paths are of the following three types 

1) 0i^3^4^2i^^l^6^0 

2) oi^3^7^6^3^7^5^ 



4-!^2^1^6i^0 



(11) 



3)oi^3i^4i^2ini-^5^4^ 
2^1^6i^0 

where a state transition from state a to state b with input label 

X and output label yz is denoted by a ^^^^> b. To be more 
specific, for the upper constituent codeword, the first (from the 
left) fundamental path is of type 3), the second is of type 1), 
the third is of type 2), the fourth is of type 2), and the fifth 
is of type 1). For the lower constituent codeword, the first 
fundamental path is of type 1), the second is of type 2), the 
third is of type 2), the fourth is of type 1), and the fifth is of 
type 3). The overall weight of the codeword is at most 64. It 
is an upper bound, since some of the fundamental paths may 
overlap. To make the structure in Fig. 10 an actual codeword, 
the following conditions must hold 
a) f{x + 9) + l = figifix + 1) + 1) + 8) (mod K) 



b) fix + 9) + 5 = figifix + 1) + 5) + 8) (mod K) 

c) fix + 4) + 2 = figifix) + 2) + 4) (mod K) 

d) fix + 4) + 10 = figifix) + 10) + 4) (mod K) 

e) /(.g(/(x+l) + l)-2) + l = /(.g(/(a;)+2)-l) (mod X) 

f) figifi^ + 1) + 1) - 2) + 4 = figifix + 1) + 5) - 2) 
(mod K) 

g) figifi^ + 1) + 1) - 2) + 9 = figifix) + 10) - 1) 
(mod K) 

where x G 'Zk is the leftmost systematic 1 -position in the 
upper constituent codeword, gix) — gix + g2X^ (mod K) 
is the inverse of fix) ~ fix + f2x'^ (mod K), and fix) 
is the QPP that generates the turbo code interleaver These 
congruences reduce to 

a) 16/252(1 + 2fix + 1)) + 16/2.91 = (mod K) 

b) 80/252(5 + 2 fix + 1)) + 8O/251 = (mod K) 

c) 32/252(1 + fix)) + I6/251 = (mod K) 

d) 160/252(5 + fix)) + 8O/251 = (mod K) 

e) 4/252(^1 + 2/1 + 2/2 + 4/2.T) = (mod K) 

f) 32/252(3 + fix + 1)) + I6/251 = (mod K) 

g) I6/251 + 4/252(49 + 2(4/(a;) - /i - /2 - 2/2X)) = 
(mod K) 

If 27 1 K, then these congruences reduce further to 

a) I6/2 = (mod K) 

b) 8O/2 = (mod K) 

c) I6/2 = (mod K) 

d) 8O/2 = (mod K) 

e) = (mod K) 

f) I6/2 = (mod K) 

g) I6/2 = (mod K) 

since 4/252 = (mod K) [28, Theorem 3.5] when 27 \ K, 
and 51 is relatively prime with K, since 4| A'. Now, if I6/2 = 
(mod K), then all these congruences are satisfied, from which 
it follows that np^2 + 4 > nK,2- Using [28, Theorem 3.6], it 
follows that the inequality above is true for all values of /2 if 



nK,2 < 



nK,3 < 



nK,p < 



4 + max( ^^^^^-^ ,1) 



,4, 

max I 
nK,p 



( r "jc,3-i i 

V 2 ' V 



, 1 ) , if nK^2 > 1 

if nK,2 = 0,1 
, 1 ) , if 71^,3 > 
if riK.s = 



if p ^ 2, 3 



which reduces to 



nK,p < 



7, ifp = 2 
1, otherwise 



(12) 



which is one of the conditions stated in the theorem. We 
will now look at the first 1 -position for each of the upper 
fundamental paths in Fig. 10. Note that since A\K (the third 
assumption in the theorem), we can change the order of 
(mod K) and (mod 4), i.e., (a; (mod K)) (mod 4) — x 
(mod 4) for any integer x. Thus, the 1-positions are x, 
gifix) + 2)-l=x + 2gi-l imod 4), g{f{x + l) + 1) -2 = 
x - 1 + 51 + 52 + 252/(.i: + 1) (mod 4), gifix + 1) + 
5) - 2 = a; - 1 + 51 + 52 + 252/(x + 1) (mod 4), and 
gifix) + 10) - 1 = 2; + 251 - 1 (mod 4). With x = 1 
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(mod 4), we get the 1-positions 1 (mod 4), 2gi (mod 4), 
91 + 92 (mod 4), gi + 52 (mod 4), and 2gi (mod 4). 

For the second component we get the following. The 1- 
positions are /(a; + 9) = /(a; + 1) (mod 4), f{x + 4) = f{x) 
(mod 4), fix), fix + 1), and /(.g(/(a: + 1) + 1) - 2) = 
/(x)-/i+/2 + l + 2/2X (mod 4). With a; = 1 (mod 4), we 
get the 1-positions 2/i (mod 4), /i + /2 (mod 4), /i + /2 
(mod 4), 2/1 (mod 4), and 1 (mod 4). 

Note that gi + 92 = fi + /2 (mod 4) and 2/i = 2gi = 2 
(mod 4) when 4 | K. 

From the fundamental paths in (1 1), it follows that the input 
to the third encoder within the patch is the all-zero sequence 
when (71 + (72 = /i + /2 = 1 (mod 4). Note that the other 
case, i.e., when gi + 32 = /i + /2 = 3 (mod 4) (/i + /2 and 
91 + 92 are odd) does not give an all-zero input sequence. 

In Fig. 11, an input-weight 17 critical codeword is shown 
for the conventional binary turbo code. The upper and lower 
constituent codewords contain 5 fundamental paths each. The 
interleaving of the systematic 1-positions is indicated by 
arrows and the 17 first letters of the English alphabet. The 
fundamental paths are of the same type as those in Fig. 10. 
To be more specific, for the upper constituent codeword, the 
first (from the left) fundamental path is of type 3), the second 
is of type 2), the third is of type 3), the fourth is of type 1), 
and the fifth is of type 2). For the lower constituent codeword, 
the first fundamental path is of type 3), the second is of type 
2), the third is of type 1), the fourth is of type 3), and the fifth 
is of type 2). The overall weight of the codeword is at most 
67. To make the structure in Fig. 1 1 an actual codeword, the 
following conditions must hold 

a) gifix) - 8) + 4 = gifix + 4) - 8) (mod K) 

b) gifix) - 8) + 1 = gifix + 1) - 8) (mod K) 

c) gifix) - 8) + 9 = gifix + 9) - 8) (mod K) 

d) gifix + 9) - 9) - 8 = gifix + 1) - 9) (mod K) 

e) gifix + 9) - 9) - 10 = gifigifix) - 10) - 1) + 1) 
(mod K) 

f) gifix + 4) - 10) = gifix) - 10) + 4 (mod K) 

g) /(5(/(a^ + 1) - 5) - 2) = figifix) - 10) - 1) + 5 
(mod K) 

h) gifix + 9) - 5) - 8 = gifix + 1) - 5) (mod K) 
where x E Zk is the leftmost systematic 1-position in the 
upper constituent codeword. We can show that if 16(?2 = 
(mod K) and 27 1 K, then all these congruences are satisfied 
for any x € Ik, and it follows that no 2 + 4 > uk;!- Using 
[28, Theorem 3.6], we get the same conditions as in (12), 



which is one of the conditions stated in the theorem. 

We will now look at the first 1-position for each of the 
upper fundamental paths in Fig. 11. Note that since \\K (the 
third assumption in the theorem), we can change the order of 
(mod K) and (mod 4). Thus, the 1-positions are x, gifix + 
9) - 9) - 10 = x - 1 - .gi + 32 - 252/(2: + 1) (mod 4), 
gifix) -8)=x (mod 4), gifix) - 10) - 1 = x - 2.91 - 1 
(mod 4), and g(/(a: + l)-5)-2 = a;-l-.gi+.92-2.g2/(a; + 
1) (mod 4). With x = 1 (mod 4), we get the 1-positions 1 
(mod 4), 3gi + g^ (mod 4), 1 (mod 4), 2gi (mod 4), and 
3gi + 92 (mod 4). 

For the second component we get the following. The 1- 
positions are fix + 1) — 9 = fix + 1) — 1 (mod 4), /(.x + 
4)-10 = fix)-2 (mod 4), /(g(/(a;)-10)-l) = /(a;)-2- 
/i + /2-2/2a; (mod 4), /(a;+9)-9 = /(a;+l)-l (mod 4), 
and fix) - 10 = /(x) - 2 (mod 4). With x = 1 (mod 4), 
we get the 1-positions 2/i — 1 (mod 4), /i + /2 — 2 (mod 4), 
2 (mod 4), 2/1 - 1 (mod 4), and /i + /2 - 2 (mod 4). 

From the fundamental paths in (1 1), it follows that the input 
to the third encoder within the patch is the all-zero sequence 
when gi + 52 = /i + /2 = 3 (mod 4). Note that the other 
case, i.e., when gi + 52 = /i + /2 = 1 (mod 4), does not 
give an all-zero input sequence. 

The result of Theorem 2 follows by combining the results 
above, i.e., there is an upper bound of 64 (resp. 67) when 
gi + 52 = /i + /2 = 1 (mod 4) (resp. 51 + 32 = /i + /2 = 3 
(mod 4)). 

Appendix B 
Proof of Theorem 3 

In Fig. 12, an input-weight 9 critical codeword is shown 
for the conventional binary turbo code. The upper and lower 
constituent codewords contain 3 fundamental paths each. The 
interleaving of the systematic 1-positions is indicated by 
arrows and the 9 first letters of the English alphabet. The 
fundamental paths are all of type 1), and the overall weight of 
the codeword is at most 27. To make the structure in Fig. 12 
an actual codeword, the following conditions must hold 

a) gifix) - 5) + 1 = gifix + 1) - 5) (mod K) 

b) gifix) - 5) + 5 = gifix + 5) - 5) (mod K) 

c) gifix) - 4) + 1 = gifix + 1) - 4) (mod K) 

d) gifix) - 4) + 5 = gifix + 5) - 4) (mod K) 

where x £ "Lk is the leftmost systematic 1-position in the 
upper constituent codeword. We can show that if 2g2 = 
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(mod K), then all these congruences are satisfied. Further- 
more, the input to the third encoder within the patch is the 
all-zero sequence. Thus, if 2g2 = (mod K), then there is 
an upper bound of 27 on the minimum distance dmin- 

Reasoning in a similar manner with an input- weight 18 
codeword, we can show that the minimum distance rfmin is 
upper-bounded by 54 if 4^2 = (mod K). The details are 
omitted for brevity. 

Appendix C 
Proof of Lemma 2 

To prove the lemma, we need the following result. 

Lemma 3: If 4|i^ and xi = X2 (mod 4), then f{xi) = 
f{x2) (mod 4) for all xi,X2 G I^k and all QPPs f{x) = 
fix + j2X^ (mod K). 

Proof: Since 4|i^, we can change the order of (mod K) 
and (mod 4), i.e., {x (mod i^)) (mod 4) = a; (mod 4) for 
any integer x. Thus, 

. if x = (mod 4), then f(x) = 4i/i + 165^/2 
(mod K)=^ (mod 4), for some integer x. 

, lfx = l (mod 4), then /(x) = (4f + l)/i + (4.^ + 1)2/2 
(mod K) = fi + /2 (mod 4), for some integer x. Since 
/2 is even and /i is odd (see item 1) of Definition 1), 
f{x) (mod 4) is odd. 

. If a; = 2 (mod 4), then f{x) = (4i + 2)/i + (4i + 2)^/2 
(mod K) = 2/i (mod 4), for some integer x. Further- 
more, since fi is odd (see item 1) of Definition 1), 
f{x) = 2 (mod 4). 

. If a; = 3 (mod 4), then f{x) = (4S-F3)/i-|-(4i-|- 3)^/2 
(mod K) = 3/1+/2 (mod 4), for some integer x. Since 
/2 is even and /i is odd (see item 1) of Definition 1), 
f{x) (mod 4) is odd. Also, 3/1-I-/2 ^ /1-I-/2 (mod 4), 
and the result follows. 

■ 

The bound in Theorem 2 is based on the upper and lower 
constituent codewords in Figs. 10 and 11. For details, see the 
proof of Theorem 2 in Appendix A. Now, consider the code- 
word in Fig. 10. Let x G Zk denote the leftmost systematic 
1-position in the upper constituent codeword. As shown in the 
proof of Theorem 2, we require x = 1 (mod 4). Also, for a 
given value of x such that x = 1 (mod 4), the fundamental 
paths in Fig. 10 may wrap around at the end of trellis. Since 
all the systematic 1-positions are determined by the value of x, 
there will be exactly Li — 1 values for x that will make the ith, 
i ~ 0, . . . , Q — 1, fundamental path wrap around at the end of 
the trellis, where Li is the length of the ith fundamental path 
and Q is the total number of fundamental paths in the upper 
and lower constituent codewords. By repeating the argument, 
we get that there will be at most L — Q values for x that will 
make at least one of the fundamental paths wrap around at 
the end of the trellis, where L = Y^f^Q^ Li. Since if xi = X2 
(mod 4), then f{xi) = f{x2) (mod 4) for any xi,X2 € Zk 
and any QPP f{x) = f\x + f2x'^ (mod K) (see Lemma 3 
above), the condition that a fundamental path should not wrap 
around at the end of the trelUs, will remove at most one more 
value of a; = 1 (mod 4) than any other value for x. Thus, 
we get the condition K - {L - Q) > 3{Q + 1) + 1, which 



simplifies to K > L + 2Q + 4 = 112, since, for the codeword 
in Fig. 10, g = 10 and L = 88. 

Note that the argument above can be repeated for the 
codeword in Fig. 11, and we get the numerical value of 
K > 120. Thus, if is: > max(112, 120) = 120, there will 
exist at least one value for x such that a; = 1 (mod 4) and 
such that none of the fundamental paths in Figs. 10 and 1 1 will 
wrap around at the end of the trellis. Furthermore, since there 
is a finite number of QPPs with a quadratic inverse for each 
value of K in the range [33, 119] and such that A\K and the 
conditions in (10) are satisfied, it can be numerically checked 
that there will always exist, for any QPP f{x) = fix + f2X^ 
(mod K) and its inverse g{x) = g\x+g2x'^ (mod K), at least 
one value for x such that a; = 1 (mod 4) and such that none of 
the fundamental paths in Fig. 10 (when /i + /2 = gi + ,92 = 1 
(mod 4)) and in Fig. 1 1 (when /i + /2 = gi + .92 = 3 
(mod 4)) will wrap around at the end of the trelUs. Also, 
for K = 32, this is not the case, and Leimna 2 is proved. 

Acknowledgment 

The authors would like to thank the anonymous reviewers 
for their helpful connments and suggestions. 

References 

[1] C. Berrou, A. Glavieux, and P. Thitimajshima, "Near Shannon limit 
error-correcting coding and decoding: Turbo-codes," in Proc. IEEE 
Int. Conf. Commun. (ICC), Geneva, Switzerland, May 1993, pp. 1064- 
1070. 

[2] C. Berrou, A. Graell i Amat, Y. Ould-Cheikh-Mouhamedou, C. Douil- 
lard, and Y. Saouter, "Adding a rate- 1 third dimension to turbo codes," 
in Proc. IEEE Inf. Theory Workshop, Lake Tahoe, CA, Sep. 2007, pp. 
156-161. 

[3] C. Berrou, A. Graell i Amat, Y. Ould-Cheikh-Mouhamedou, and 
Y. Saouter, "Improving the distance properties of turbo codes using a 
third component code: 3D turbo codes," IEEE Trans. Commun., vol. 57, 
no. 9, pp. 2505-2509, Sep. 2009. 

[4] J. Li, K. R. Narayanan, and C. N. Georghiades, "Product accumulate 
codes: A class of codes with near-capacity performance and low 
decoding complexity," IEEE Trans. Inf. Theory, vol. 50, no. I, pp. 31— 
46, Jan. 2004. 

[5] H. Gonzalez, C. Berrou, and S. Kerouedan, "Serial/parallel (s/p) turbo 

codes for low error rates," in Proc. IEEE Int. Conf Commun. (ICC), 

vol. 1, Paris, France, Jun. 2004, pp. 346-350. 
[6] A. Perotti and S. Benedetto, "A new upper bound on the minimum 

distance of turbo codes," IEEE Trans. Inf. Theory, vol. 50, no. 12, pp. 

2985-2997, Dec. 2004. 
[7] , "An upper bound on the minimum distance of serially concate- 
nated convolutional codes," IEEE Trans. Inf. Theory, vol. 52, no. 12, 

pp. 5501-5509, Dec. 2006. 
[8] H. D. Pfister and P. H. Siegel, "The serial concatenation of rate-1 codes 

through uniform random interleavers," IEEE Trans. Inf. Theory, vol. 49, 

no. 6, pp. 1425-1438, Jun. 2003. 
[9] C. Ravazzi and F. Fagnani, "Spectra and minimum distances of repeat 

multiple-accumulate codes," IEEE Trans. Inf. Theory, vol. 55, no. 11, 

pp. 4905^924, Nov. 2009. 
[10] H. D. Pfister, "On the capacity of finite state channels and the analysis 

of convolutional accumulate-m codes," Ph.D. dissertation, University 

of California San Diego, San Diego, CA, 2003. [Online]. Available: 

http://www.ece.tamu.edu/~hpfister/research.html 
[11] R. G. Gallager, Low-Density Parity-Check Codes. Cambridge, MA: 

MIT Press, 1963. 

[12] J. Kliewer, K. S. Zigangirov, and D. J. CosteUo, Jr., "New results on the 
minimum distance of repeat multiple accumulate codes," in Proc. 45th 
Annual Allerton Conf. Commun., Control, and Computing, Monticello, 
IL, Sep. 2007. 

[13] C. Roller, A. Graell i Amat, J. Kliewer, F. Vatta, and D. J. CosteUo, 
Jr., "Hybrid concatenated codes with asymptotically good distance 
growth," in Proc. 5th Int. Symp. Turbo Codes & Rel. Topics, Lausanne, 
Switzerland, Sep. 2008, pp. 19-24. 



14 



[14] , "Tuned turbo codes," in Proc. Int. Symp. Inf. Theory and its Appl. 

(ISITA), Auckland, New Zealand, Dec. 2()08, pp. 1367-1372. 
[15] I. Sason, E. Telatar, and R. Urbanke, "On the asymptotic input-output 

weight distributions and thresholds of convolutional and turbo-like 

encoders," IEEE Trans. Inf. Theory, vol. 48, no. 12, pp. 3052-3061, 

Dec. 2002. 

[16] S. Crozier and P. Guinand, "Distance upper bounds and true minimum 
distance results for turbo-codes designed with DRP interleavers," in 
Proc. 3rd Int. Symp. Turbo Codes & Rel. Topics, Brest, France, Sep. 
2003, pp. 169-172. 

[17] S. Crozier, P. Guinand, and A. Hunt, "Computing the minimum distance 
of turbo-codes using iterative decoding techniques," in Proc. 22th 
Biennial Symp. Commun., Kingston, ON, Canada, May/Jun. 2004, pp. 
306-308. 

[18] C. Berrou, S. Kerouedan, Y. Saouter, C. Douillard, and M. Jezequel, 
"Designing good permutations for turbo codes: Towards a single model," 
in Proc. IEEE Int. Conf. Commun. (ICC), vol. 1, Paris, France, Jvm. 2004, 
pp. 341-345. 

[19] J. Sun and O. Y. Takeshita, "Interleavers for turbo codes using permu- 
tation polynomials over integer rings," IEEE Trans. Inf. Theory, vol. 51, 
no. 1, pp. 101-119, Jan. 2005. 

[20] E. Rosnes and O. Y. Takeshita, "Optimum distance quadratic permu- 
tation polynomial-based interleavers for turbo codes," in Proc. IEEE 
Int. Symp. Inf Theory (ISIT), Seattle, WA, Jul. 2006, pp. 1988-1992. 

[21] O. Y. Takeshita, "Permutation polynomial interleavers: An algebraic- 
geometric perspective," IEEE Tram. Inf. Theory, vol. 53, no. 6, pp. 
2116-2132, Jun. 2007. 

[22] A. Nimbalker, T. K. Blankenship, B. Classon, T. E. Fuja, and 
D. J. Costello, Jr., "Contention-free interleavers," in Proc. IEEE 
Int. Symp. Inf Theory (ISIT), Chicago, XL, Jun./Jul. 2004, p. 54. 



[23] T. K. Blankenship, B. Classon, and V. Desai, "High-throughput turbo 
decoding techniques for 4G," in Proc. Int. Conf. 3G Wireless and 
Beyond, San Francisco, CA, May 2002, pp. 137-142. 

[24] O. Y. Takeshita, "On maximum contention-free interleavers and permu- 
tation polynomials over integer rings," IEEE Trans. Inf. Theory, vol. 52, 
no. 3, pp. 1249-1253, Mar. 2006. 

[25] Multiplexing and Channel Coding (FDD), 3rd Generation Partnership 
Project, Jun. 1999, 3G TS 25.212. 

[26] S. Benedetto and G. Montorsi, "Unveiling turbo codes: Some results on 
parallel concatenated coding schemes," IEEE Trans. Inf. Theory, vol. 42, 
no. 2, pp. 409-428, Mar. 1996. 

[27] D. Divsalar, "A simple tight bound on error probability of block codes 
with application to turbo codes," Nov. 1999, TMO Progress Report 42- 
139, JPL, Pasadena, CA. 

[28] J. Ryu and O. Y. Takeshita, "On quadratic inverses for quadratic 
permutation polynomials over integer rings," IEEE Trans. Inf Theory, 
vol. 52, no. 3, pp. 1254-1260, Mar. 2006. 

[29] E. Rosnes, "Further results on quadratic permutation polynomial-based 
interleavers for turbo codes," Department of Informatics, University of 
Bergen, Norway, Tech. Report no. 396, Feb. 2011. [Online]. Available: 
http://www.ii.uib.no/publikasjoner/texrap/pdf/201 l-396.pdf 

[30] P. Guinand and J. Lodge, "Trellis termination for turbo encoders," in 
Proc. I7th Biennial Symp. Commun., Kingston, ON, Canada, May/Jun. 
1994, pp. 389-392. 

[31] S. ten Brink, "Convergence behaviour of iteratively decoded parallel 
concatenated codes," IEEE Trans. Commun., vol. 49, no. 10, pp. 1727- 
1737, Oct. 2001. 

[32] A. Graell i Amat and E. Rosnes, "Analysis of 3-dimensional turbo code 
ensembles," in Proc. IEEE Inf. Theory Workshop, Cairo, Egypt, Jan. 
2010, pp. 46-50. 



